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Abstract 

We give an estimate of the number of geometrically distinct open tours G for a knight 
on a chessboard. We use a randomization of Warnsdorff rule to implement importance 
sampling in a backtracking scheme, correcting the observed bias of the original rule, 
according to the proposed principle that "most solutions follow Warnsdorff rule most of 
the time". After some experiments in order to test this principle, and to calibrate a 
parameter, interpreted as a distance of a general solution from a Warnsdorff solution, we 
conjecture that G = 1.22 x 10^^. 
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1 Introduction 



In this paper we give an estimate of the number of open Knight Tours in a chessboard, based on 
the heuristics given by the classical Warnsdorff rule, used to implement importance sampling 
in a backtracking algorithm, following [Knuth 1975]. The historical development of the Knight 
Tour problem is beyond the possibilities of the present work, and the interested reader is 
suggested to visit George Jelliss's web page and the references therein [Jelliss 2003], from where 
we borrow our notation. Let us define precisely the quantities we are interested in: 

• G is the number of Geometrically distinct open tours. 

• T is the number of open Tour diagrams, by rotation and symmetry T = 8 G. 

• N is the number of open tour Numberings: N = 2 T = 16 G since each can be numbered 
from either end. 

• D is the number of closed tour Diagrams. 

These parameters refer to the standard 8x8 chessboard. When other sizes of (squared) 
chessboards are considered, we include a subscript in our notation, for instance Ng is the 
number of open tour numberings in a 6 x 6 chessboard. 

In the terms just introduced our purpose is to give an estimate of G. A relevant exact 
computation related to this problem was carried out independently in [McKay 1997] and in 
[Wegener 2000], who computed the number D = 13,267,364,410,532. 

2 Importance sampling through Warnsdorff rule 

Warnsdorff rule was proposed in 1823 in order to find open tours in a chessboard. It states: "at 
each step, choose the square with the lowest number of possible continuations; if a tie occurs it 
may be broken arbitrarily" [Warnsdorff 1823]. This heuristic rule is incredibly effective to find 
solutions. Only 150 years after being formulated, with the help of a computer, it was reported 
that the rule can fail, in respect to its second part [Guik 1983]. Modern numerical experiments 
showed that this rule is biased, in the sense that it founds solutions in an extremely small subset 
of the set of solutions, fact that we confirm. 

Our departure point in order to estimate the total amount of open tours, is the principle 
that "most solutions follow Warnsdorff rule most of the time", i.e. a typical solution is a 
concatenation of short paths satisfying the rule within them, that, when concatenated violate 
the rule. According to this principle we decompose the number G as the sum G = Go + - ■ ■ + G62 
where we denote by G^ (A; = 0, 1, ... , 62) the number of geometrically distinct knight tours that 
violates the rule exactly k times. In particular Go is the number of knight tours that follows 
Warnsdorff rule (including any of the possibilities in case of tie). In section 0] we provide some 
empirical evidence supporting this principle, giving an estimation of G^ for different values of 
k. We obtain the largest number when k = 13, and significant numbers in a range from k = 5 
tok = 20. 
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In consequence, following [Knuth 1975] and these considerations, we implement a random- 
ized backtracking algorithm, that we call the "Randomized Warnsdorff Rule". We begin by 
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Figure 1: Relevant initial squares 



fixing the ten geometrically different initial squares (see Figure |21) and, at step n, we assign 
probabilities p{n,a,i{n),j) to a jump from the current square i{n) to each connected square 
j, proportional to Nj'" where Nj is the number of free accessible squares from j and a is an 
Importance Sampling (IS) parameter. The estimation of the number of tours for each run is 

rinLi ct) i{n),i{n + 1)) ^ j when the algorithm finds the solution {i{n)}, 

when the algorithm does not complete a tour. 

This is an unbiased estimate of the number of tours beginning at i(l), for all positive probability 
assignations [Knuth 1975]. The problem is then to choose the probabilities p(n, a, i,j^ in order 
to achieve the best possible variance reduction, and here we use Warnsdorff heuristics. 

Two important cases of our scheme are a = 0, corresponding to plain Monte Carlo simu- 
lation in the graph, and a = oo, that gives maximum (equal) probability to the nodes with 
minor amount of free nodes, and null probability to the others, i.e. the original Warnsdorff rule 
with equiprobability in case of tie, giving a biased estimate. 

The first step of our work consists in calibrating the parameter a, that measures the dis- 
tance form a generic solution from a Warnsdorff solution, running our randomized Warnsdorff 
algorithm to estimate three known quantities, N5, Dg and D. It is important to note (and it 
has been noticed in the literature) that the optimal a is sensible to the size of the chessboard. 
We report the results in Section El 

The second step, once we calibrate a = 1.5, consists in running a relatively long sampling 
experiment to obtain our final estimate of G, presented in Section We use this run also to 
compute the results presented in Section HI 

3 Parameter tuning 

In this section we present the results of some numerical experiments that were designed in order 
to validate the proposed method and at the same time to tune a, the importance sampling 
parameter. 
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We estimate three different Icnown quantities using the randomized Warnsdorff rule: N5 = 
1728, obtained in [Planclc 1908] and [Planck 1909], Dg = 98626 obtained in [Duby 1964], and 
the McKay- Wegener number D. 

To define the experimental design, it is necessary to fix a certain number of parameters, 
in particular the sampling size for the Monte Carlo experiments, the number of replications 
with different seeds, and the importance sampling parameter a. The two first parameters were 
fixed taking into account some preliminary experiments and the need to complete work in a 
reasonable computing time; the sampling size was then fixed to 5 x 10^ for the 5x5 board 
and to 10^ for the other two cases, and 21 replications were made with different seeds. Even if 
a is a continuous parameter, it is necessary to decide on a (small) set of values for employing 
in the experiments. It is important to include the case where there is no IS at all [a = 0), 
as well as a relatively wide range of values where the method has different behaviors. We 
also included a = —1, which corresponds to an IS scheme which favors tours "far away from 
Warnsdorff rule" . Based on the same preliminary experiments, we choose to take a in the set 
{ — 1, 0, 0.5, 1, 1.5, 2, 2.5, 3, 4, 5, 8, 10}. The total number of runs is then 756 (three problems, 12 
different values for a, 21 independent replications with different seeds for each of these). The 
Monte Carlo method was programmed using C (compiled with gcc 3.3.3 and 3.4.3), and all 
the experiments were run on an personal computer (CPU 1.70GHz, RAM 224508 kB) running 
Linux. Each experiment took in the order of 100 CPU seconds on the 5x5 board, 2000 CPU 
seconds on the 6x6 board and 15000 seconds on 8 x 8 boards (as reported by the times () 
function of the ' 'sys/times .h' ' library). A summary of the results obtained is shown in 
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Table 1: Quality of estimations of N5=1728 as function of the IS parameter a. 

Tables El and El respectively for the three problems studied. Given a problem, for instance, 
the estimation of N5, the number of open tours numberings in a 5 x 5 chessboard, and a value 
for a, we have i = 1, . . . , 21 independent replications, each returning an estimation numbered 
through N5(z), and an estimation a{i) of the standard deviation (i.e. the square root of the 
variance) of this z-th estimation. The final estimation, the mean in i, is denoted N5. Each table 
has five columns, the first one corresponding to the a values, and the following ones showing 
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the minimum, the estimation, and maximum values for the 21 estimations, an estimation a 
of the standard deviation of our estimate, and the mean of the execution times. The same 
analysis is performed with Dg and D. 

It comes as no surprise to observe that the first two problems are rather easy, and a wide 
range of a values result in estimations quite near to the exact value. It is clear that the variance 
of the estimation depends strongly on a; in both cases the best value is observed when a — 2 
(but a = 1.5 and a = 2.5 come very close). High values for a seem to greatly affect the 
precision of the method (this can be specially be seen in the last two lines for the 6x6 board, 
where the estimations are quite bad). Also, a = —1 leads to inconsistent results. 

The standard (no importance samphng) Monte Carlo results can be seen in the a — line. 
In both cases the standard method gives quite consistent estimates for the number of tours, 
but using Importance Sampling we can attain a variance orders of magnitude smaller. 

From these experimental results, it can be seen that in the three cases under study the pro- 
posed method works well and is more robust and more efficient than the standard Monte Carlo, 
which in some cases does not give meaningful estimations of the desired parameters. Fixing 
a can be a problem, as the method is clearly dependent on this parameter, but fortunately it 
seems as if the choice of a around 1.5 or 2 is rather robust, as those are the values with best 
results in these three problems. As we want to estimate in the bigger chessboard, we decide 
ourselves for a = 1.5. Computing the number of closed tour diagrams on the 8x8 board is 
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Table 2: Quality of estimations of Dg = 9862 as function of the IS parameter a. 

a more demanding problem, and the standard (no importance sampling) method can not find 
good estimates (nor can do it the method with a = —1). In general, the IS estimates quality 
depends heavily on a, and only when 0.5 < a < 3 we find estimation values that approach the 
exact value. Here the setting a = 1.5 leads to the lowest variance. 
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Table 3: Quality of estimations of D = 13, 267, 364, 410, 532 as a function of a. 

4 On the set of open tours 

In this section we present some empirical evidence supporting our departure point, that is, that 
"most solutions follow Warnsdorff rule most of the time" . Firstly, we present in Figure El a 
histogram where we show the estimations of obtained in the final simulation (described in 
next section), in the case and a = 1.5, with 10^ replications. 
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Figure 2: Estimated number of tours Gk for k = 0, ... ,25 [N = 10^). 

In the Table m we present the corresponding values plotted in the Figure |21 It is concluded 
that values from G5 to G20 concentrate the 99.9892% (practically all) solutions, and also that 
Go << G, i.e. the number of Warnsdorff solutions is several orders smaller than the number 
of solutions. 
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Table 4: Estimations of G^ corresponding to Figure |21 



5 On the number G of open tours 

In this section we present our main results, corresponding to the final numerical experiment. In 
this case we perform the randomized Warnsdorff algorithm (as explained in|2)), with a = 1.5, 
as suggested by the experiments exposed in sectional We choose a sample size corresponding 
to 10^ size, as this seems to provide a reasonable result in an acceptable computational time. 
In our case CPU time was 6.5 x 10^ seconds ~ 7.5 days, on a similar personal computer. 

In our runs we distinguish according to the 10 different initial squares (see Figure Ej), and, 
in the four diagonal squares we estimate half of the solutions, based on the symmetry. The 
dynamic nature of Warnsdorff rule does not seem to give a simple way of exploiting the "go 
and return" symmetry, and, in consequence, our crude algorithm gives an estimation of 2G. 
To have an idea of the error of our method we counted also the closed tours, obtaining the 
estimation D=1.326390E+13 (compare with the exact value D=1.3267364410532E+13) with 
a standard deviation (T=3.03E+10. 

Our final results are presented in Table El Based on these final results we construct a 
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Table 5: Estimation of G (sample size lOE+9). 

confidence interval for G. Taking into account that for a normal random variable X with 
mean G and variance a we have P(G — 3a < X < G + 3a) = 0.997, we conclude that 
G G [1.220, 1.225] X 10^^ and from there our main conclusion G = 1.22 x 10^^. 
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